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RAW SEQUENCE LISTING DATE: 05/22/2001 

PATENT APPLICATION: US/09/840,746 TIME: 15:24:25 

Input Set : A:\Pto.amc 

Output Set: C:\CRF3\05222001\l840746.raw 

2 <110> APPLICANT: Chen, Huei-Mei 

3 Honchell, Cynthia D. 

4 Tang, Y. Tom 

6 <120> TITLE OF INVENTION: Mucin-Related Tumor Marker 

8 <130> FILE REFERENCE: PC-0039 US 
C~> 10 <140> CURRENT APPLICATION NUMBER: US/09/840,746 
C~> 11 <141> CURRENT FILING DATE: 2001-04-23 

13 <160> NUMBER OF SEQ ID NOS : 20 

14 <170> SOFTWARE: PERL Program 

16 <210> SEQ ID NO: 1 

17 <211> LENGTH: 946 

18 <212> TYPE: PRT 

19 <213> ORGANISM: Homo sapiens 

21 <220> FEATURE: 

22 <221> NAME/KEY: misc_feature 

23 <223> OTHER INFORMATION: Incyte ID No: 182514CD1 
25 <400> SEQUENCE: 1 



2 6 Met Ser Gin Thr Glu Thr Val Ser Arg Ser Val Ala Pro Met Arg 

27 1 5 10 15 

28 Gly Gly Glu He Thr Ala His Trp Leu Leu Thr Asn Ser Thr Thr 

29 20 25 30 

30 Ser Ala Asp Val Thr Gly Ser Ser Ala Ser Tyr Pro Glu Gly Val 

31 35 40 45 

32 Asn Ala Ser Val Leu Thr Gin Phe Ser Asp Ser Thr Val Gin Ser 
"33 50 55 60 

34 Gly Gly Ser His Thr Ala Leu Gly Asp Arg Ser Tyr Ser Glu Ser 

35 65 70 75 

36 Ser Ser Thr Ser Ser Ser Glu Ser Leu Asn Ser Ser Ala Pro Arg 
' 37 80 85 90 

38 Gly Glu Arg Ser He Ala Gly He Ser Tyr Gly Gin Val Arg Gly 

39 95 100 105 

40 Thr Ala He Glu Gin Arg Thr Ser Ser Asp His Thr Asp His Thr 

41 110 115 120 

42 Tyr Leu Ser Ser Thr Phe Thr Lys Gly Glu Arg Ala Leu Leu Ser 

43 125 130 135 

44 He Thr Asp Asn Ser Ser Ser Ser Asp He Val Glu Ser Ser Thr 

45 140 145 150 

46 Ser Tyr He Lys He Ser Asn Ser Ser His Ser Glu Tyr Ser Ser 
47' 155 160 165 

48 Phe Ser His Ala Gin Thr Glu Arg Ser* Asn He Ser Ser Tyr Asp 

49 170 175 180 

50 Gly Glu Tyr Ala Gin Pro Ser Thr Glu Ser Pro Val Leu His Thr 

51 185 190 195 

52 Ser Asn Leu Pro Ser Tyr Thr Pro Thr He Asn Met Pro Asn Thr 

53 200 205 210 

54 Ser Val Val Leu Asp Thr Asp Ala Glu Phe Val Ser Asp Ser Ser 

55 215 220 225 
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Input Set : A:\Pto.amc 

Output Set: C:\CRF3\05222001\I840746 



raw 



56 Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Gly Pro Pro 

57 230 235 240 

58 Leu Pro Leu Pro Ser Val Ser Gin Ser His His Leu Phe Ser Ser 

59 245 250 255 

60 lie Leu Pro Ser Thr Arg Ala Ser Val His Leu Leu Lys Ser Thr 

61 260 265 270 

62 Ser Asp Ala Ser Thr Pro Trp Ser Ser Ser Pro Ser Pro Leu Pro 

63 275 280 285 

64 Val Ser Leu Thr Thr Ser Thr Ser Ala Pro Leu Ser Val Ser Gin 

65 290 295 300 

66 Thr Thr Leu Pro Gin Ser Ser Ser Thr Pro Val Leu Pro Arg Ala 

67 305 310 315 

68 Arg Glu Thr Pro Val Thr Ser Phe Gin Thr Ser Thr Met Thr Ser 

69 320 325 330 

70 Phe Met Thr Met Leu His Ser Ser Gin Thr Ala Asp Leu Lys Ser 

71 335 340 345 

72 Gin Ser Thr Pro His Gin Glu Lys Val lie Thr Glu Ser Lys Ser 

73 350 355 360 

74 Pro Ser Leu Val Ser Leu Pro Thr Glu Ser Thr Lys Ala Val - Thr 

75 365 370 375 

76 Thr Asn Ser Pro Leu Pro Pro Ser Leu Thr Glu Ser Ser Thr Glu 

77 380 385 390 

78 Gin Thr Leu Pro Ala Thr Ser Thr Asn Leu Ala Gin Met Ser Pro 

79 395 400 405 

80 Thr Phe Thr Thr Thr lie Leu Lys Thr Ser Gin Pro Leu Met Thr 

81 410 415 420 

82 Thr Pro Gly Thr Leu Ser Ser Thr Ala Ser Leu Val Thr Gly Pro 

83 425 430 435 

84 lie Ala Val Gin Thr Thr Ala Gly Lys Gin Leu Ser Leu Thr His 

85 440 445 450 

86 Pro Glu He Leu Val Pro Gin He Ser Thr Glu Gly Gly He Ser 

87 455 460 465 

88 Thr Glu Arg Asn Arg Val He Val Asp Ala Thr Thr Gly Leu He 

89 470 475 480 

90 Pro Leu Thr Ser Val Pro Thr Ser Ala Lys Glu Met Thr Thr Lys 

91 485 490 495 

92 Leu Gly Val Thr Ala Glu Tyr Ser Pro Ala Ser Arg Ser Leu Gly 

93 500 505 510 

94 Thr Ser Pro Ser Pro Gin Thr Thr Val Val Ser Thr Ala Glu Asp 

95 515 520 525 

96 Leu Ala Pro Lys Ser Ala Thr Phe Ala Val Gin Ser Ser Thr Gin 

97 530 535 540 

98 Ser Pro Thr Thr Leu Ser Ser Ser Ala Ser Val Asn Ser Cys Ala 

99 545 550 555 

100 Val Asn Pro Cys Leu His Asn Gly Glu Cys Val Ala Asp Asn Thr 

101 560 565 570 

102 Ser Arg Gly Tyr His Cys Arg Cys Pro Pro Ser Trp Gin Gly Asp 

103 575 580 585 

104 Asp Cys Ser Val Asp Val Asn Glu Cys Leu Ser Asn Pro Cys Pro 
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105 590 595 600 

106 Ser Thr Ala Thr Cys Asn Asn Thr Gin Gly Ser Phe He Cys Lys 

107 605 610 615 

108 Cys Pro Val Gly Tyr Gin Leu Glu Lys Gly He Cys Asn Leu Val 

109 620 625 630 

110 Arg Thr Phe Val Thr Glu Phe Lys Leu Lys Arg Thr Phe Leu Asn 

111 635 640. 645 

112 Thr Thr Val Glu Lys His Ser Asp Leu Gin Glu Val Glu Asn Glu 

113 650 655 660 

114 He Thr Lys Thr Leu Asn Met Cys Phe Ser Ala Leu Pro Ser Tyr 

115 665 670 675 

116 He Arg Ser Thr Val His Ala Ser Arg Glu Ser Asn Ala Val Val 

117 680 685 690 

118 He Ser Leu Gin Thr Thr Phe Ser Leu Ala Ser Asn Val Thr Leu 

119 695 700 705 

120 Phe Asp Leu Ala Asp Arg Met Gin Lys Cys Val Asn Ser Cys Lys 

121 710 715 720 

122 Ser Ser Ala Glu Val Cys Gin Leu Leu Gly Ser Gin Arg Arg He 

123 725 730 735 

124 Phe Arg Ala Gly Ser Leu Cys Lys Arg Lys Ser Pro Glu Cys Asp 

125 740 745 750 

126 Lys Asp Thr Ser He Cys Thr Asp Leu Asp Gly Val Ala Leu Cys 

127 755 760 765 

128 Gin Cys Lys Ser Gly Tyr Phe Gin Phe Asn Lys Met Asp His Ser 

129 770 775 780 

130 Cys Arg Ala Cys Glu Asp Gly Tyr Arg Leu Glu Asn Glu Thr Cys 

131 785 790 795 

132 Met Ser Cys Pro Phe Gly Leu Gly Gly Leu Asn Cys Gly Asn Pro 

133 800 805 810 

134 Tyr Gin Leu He Thr Val Val He Ala Ala Ala Gly Gly Gly Leu 

135 815 820 825 

136 Leu Leu He Leu Gly He Ala Leu He Val Thr Cys Cys Arg Lys 

137 830 835 840 

138 Asn Lys Asn Asp He Ser Lys Leu He Phe Lys Ser Gly Asp Phe 

139 845 850 855 

140 Gin Met Ser Pro Tyr Ala Glu Tyr Pro Lys Asn Pro Arg Ser Gin 

141 860 865 870 

142 Glu Trp Gly Arg Glu Ala He Glu Met His Glu Asn Gly Ser Thr 

143 875 880 885 
14 4 Lys Asn Leu Leu Gin Met Thr Asp Val Tyr Ty^ Ser Pro Thr Ser 
145 890 895 900 
14 6 Val Arg Asn Pro Glu Leu Glu Arg Asn Gly Leu Tyr Pro Ala Tyr 

147 905 910 915 

148 Thr Gly Leu Pro Gly Ser Arg His Ser Cys He Phe Pro Gly Gin 

149 920 925 930 

150 Tyr Asn Pro Ser Phe He Ser Asp Glu Ser Arg Arg Arg Asp Tyr 

151 935 940 945 

152 Phe 



155 <210> SEQ ID NO: 2 
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156 <211> LENGTH: 6952 

157 <212> TYPE: DNA 

158 <213> ORGANISM: Homo sapiens 

160 <220> FEATURE: 

161 <221> NAME/KEY: misc_f eature 

162 <223> OTHER INFORMATION: Incyte ID No: 182514CB1 

164 <400> SEQUENCE: 2 

165 gttcgatgaa agaattgccg cttttcaaac aaagagtgga acagcctcgg agatgggaac 60 

166 agagagggcg atggggctgt cagaagaatg gactgtgcac agccaagagg ccaccacttc 120 

167 ggcttggagc ccttcctttc ttcctgcttt ggagatggga gagctgacca cgccttctag 180 

168 gaagagaaat tcctcaggac cagatctctc ctggctgcat ttctacagga cagcagcttc 240 

169 ctctcctctc ttagaccttt cctcaccttc tgaaagtaca gagaagctta acaactccac 300 

170 tggcctccag agctcctcag tcagtcaaac aaagacaatg catgttgcta ccgtgttcac 360 

171 tgatggtggc ccgagaacgc tgcgatcttt gacggtcagt ctgggacctg tgagcaagac 420 

172 agaaggcttc cccaaggact ccagaattgc cacgacttca tcctcagtcc ttctttcacc 480 

173 ctctgcagtg gaatcgagaa gaaacagtag agtaactggg aatccagggg atgaggaatt 540 

174 cattgaacca tccacagaaa atgaatttgg acttacgtct ttgcgtggca aaatgattcc 600 

175 ccaacctttg gagaacatca gcttgccagc agctctgagg tgcaaaatgg aagtcccatg 660 

176 tctcagactg agactgtgtc taggtcagtc gcacccatga gaggtggaga gatcactgca 720 

177 cactggctct tgaccaacag cacaacatct gcagatgtga caggaagctc tgcttcatat 780 

178 cctgaaggtg tgaatgcttc agtgttgacc cagttctcag actctactgt acagtctgga 840 

179 ggaagtcaca cagcattggg agataggagt tattcagagt cttcatctac atcttcctcg 900 

180 gaaagcttga attcatcagc accacgtgga gaacgttcaa tcgctgggat tagctacggt 960 

181 caagtgcgtg gcacagctat tgaacaaagg acttccagcg accacacaga ccacacctacl020 
182. ctgtcatcta ctttcaccaa aggagaacgg gcgttactgt ccattacaga taacagttcal080 

183 tcctcagaca ttgtggagag ctcaacttct tatattaaaa tctcaaactc ttcacattcall4 0 

184 gagtattcct ccttttctca tgctcagact gagagaagta acatctcatc ctatgacgggl200 

185 gaatatgctc agccttctac tgagtcgcca gttctgcata catccaacct tccgtcctacl260 

186 acacccacca ttaatatgcc gaacacttcg gttgttctgg acactgatgc tgagtttgtt 1320 

187 agtgactcct cctcctcctc ttcctcctcc tcctcttctt cttcttcagg gcctcctttgl380 

188 cctctgccct ctgtgtcaca atcccaccat ttattttcat caattttacc atcaaccaggl44 0 

189 gcctctgtgc atctactaaa gtctacctct gatgcatcca caccatggtc ttcctcaccal500 

190 tcacctttac cagtatcctt aacgacatct acatctgccc cactttctgt ctcacaaacal560 

191 accttgccac agtcatcttc tacccctgtc ctgcccaggg caagggagac tcctgtgact 1620 

192 tcatttcaga catcaacaat gacatcattc atgacaatgc tccatagtag tcaaactgcal680 

193 gaccttaaga gccagagcac cccacaccaa gagaaagtca ttacagaatc aaagtcaccal740 
19'4 agcctggtgt ctctgcccac agagtccacc aaagctgtaa caacaaactc tcctttgcct 1800 

195 ccatccttaa cagagtcctc cacagagcaa acccttccag ccacaagcac caacttagcal860 

196 caaatgtctc caactttcac aactaccatt ctgaagacct ctcagcctct tatgaccact 1920 

197 cctggcaccc tgtcaagcac agcatctctg gtcactggcc ctatagccgt acagactacal980 

198 gctggaaaac agctctcgct gacccatcct gaaatactag ttcctcaaat ctcaacagaa204 0 

199 ggtggcatca gcacagaaag gaaccgagtg attgtggatg ctaccactgg ■ attgatccct2100 

200 ttgaccagtg tacccacatc agcaaaagaa atgaccacaa agcttggcgt tacagcagag2160 

201 tacagcccag cttcacgttc cctcggaaca tctccttctc cccaaaccac agttgtttcc2220 

202 acggctgaag acttggctcc caaatctgcc acctttgctg ttcagagcag. cacacagtca2280 

203 ccaacaacac tgtcctcttc agcctcagtc aacagctgtg ctgtgaaccc ttgtcttcac2340 

204 aatggcgaat gcgtcgcaga caacaccagc cgtggctacc actgcaggtg cccgccttcc2400 

205 tggcaagggg atgattgcag tgtggatgtg aatgagtgcc tgtcgaaccc ctgcccatcc24 60 

206 acagccacgt gcaacaatac tcagggatcc tttatctgca aatgcccggt tgggtaccag2520 
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RAW SEQUENCE LISTING DATE: 05/22/2001 
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207 ttggaaaaag ggatatgcaa tttggttaga accttcgtga cagagtttaa attaaagaga2580 

208 acttttctta atacaactgt ggaaaaacat tcagacctac aagaagttga aaatgagatc2640 

209 accaaaacgt taaatatgtg tttttcagcg ttacctagtt acatccgatc tacagttcac2700 

210 gcctctaggg agtccaacgc ggtggtgatc tcactgcaaa caaccttttc cctggcctcc2760 

211 aatgtgacgc tatttgacct ggctgatagg atgcagaaat gtgtcaactc ctgcaagtcc2820 

212 tctgctgagg tctgccagct cttgggatct cagaggcgga tctttagagc gggcagcttg2880 

213 tgcaagcgga agagtcccga atgtgacaaa gacacctcca tctgcactga cctggacggc2940 

214 gttgccctgt gccagtgcaa gtcgggatac tttcagttca acaagatgga ccactcctgc3000 

215 cgagcatgtg aagatggata taggcttgaa aatgaaacct gcatgagttg cccatttggc3060 

216 cttggtggtc tcaactgtgg aaacccctat cagcttatca ctgtggtgat cgcagccgcg3120 

217 . ggaggtgggc tcctgctcat cctaggcatc gcactgattg ttacctgttg cagaaagaat 3180 

218 aaaaatgaca taagcaaact catcttcaaa agtggagatt tccaaatgtc cccatatgct324 0 

219 gaatacccca aaaatcctcg ctcacaagaa tggggccgag aagctattga aatgcatgag3300 

220 aatggaagta ccaaaaacct cctccagatg acggatgtgt actactcgcc tacaagtgta3360 

221 aggaatccag aacttgaacg aaacggactc tacccggcct acactggact gccaggatca3420 

222 cggcattctt gcattttccc cggacagtat aacccgtctt tcatcagtga tgaaagcaga3480 

223 agaagagact acttttaagt ccaggagaga gagggactca ttgctctgag ccagtcacct3540 

224 gggacctctg ctcagaggac cgcaccagga ggctgcgccc aggatttgtc gggagccacg3600 

225 . ctgagtggca agcaggaaga gggacaggca tgcggggcgt gaccacagtg gaggagacag3660 

226 gtggatgtgg aaccacaggc tgctcattca gcacctttgt tgttactgtg aacgtgaatg3720 

227 tgggccagta tcaagagagt ctctctgagt gactgcacca tggcactggc accagggcga3780 

228 ctattagcca gggcagacca ctagacttca gtgcagggac ctggttttcc cttcgtttgc3840 

229 actttagtaa attgggtggg aggtttcctt ttggatctgt tttgagactg ttccagaaag3900 

230 aaggcttcct ttcccgagac acttccatag gcagcaattt ggtgattcat ttgcagcaaa3960 

231 atactggctt gttaattatt ttcctgccca gcgcctgcgt gctaaacaac agatgaggat4020 

232 gagcgtacca ctgaagtctg aagatgtcgc cattgaacgg acagtgtttt catatgtttc4 080 

233 taggttgtct tatgctacag tttccaagcc agcccccaca gtgaggaaat gtgtgaggca4 140 

234 ccgcacacaa ctgcaatgtg ttttttaagt caaggtgaca catgtattta agattttttt 4200 

235 ttaaaatctc tttgcagtta aatctcactt tttcaaacaa gcctggatca gggcaaaaca4260 

236 acttatattt ggttttagct ggaggctcag caggcagatt gcaggcaggg gggcactttt 4 320 

237 catccatgag ggcccagcct ggggcctggg actctgatca ccattgtgga ggccagaggc4 380 

238 agctgcgtat ggaggagaaa tgtcaaactg aacgcaggtt tcaccactct aggaaagcag4 4 40 

239 cttgttgagc ccctgcagct ggatgtggtt agagggatgg gctgaatagg caggttagat4500 

240 ttcctgcatc aacagtgctt tgggaagctg tgtggattcc tgaggaagaa cagggagccg4560 

241 agatggagcc acacatgagt ttgctcaccg gctactgcag cactttgtac ccagaatctc4 620 

242 atgtccacaa accccatgta aactttcaac cactcaaagc tgtttattcg gctgaagaaa4 680 

243 taactttttt ttctcaccca gtcatttgta cctcttcata tggctgtgtc gcaccctcca474d 

244 gaaacgtggt tatacttcca gtcagtgtgg gagaactgaa gacttccggt tggtcgagga4 800 

245 actgagggtt gaccttcggg aaggaagttc cactcatctt atttattatg cctgtgatgt4860 

246 gggtcctgcc agggagacat ccagtactcg gtgtctttaa ttgccacctg gggaactgtg4 920 

247 tttattggcc ttctttgggg catcctggtt ttggatgaag tgaggggaat acagaggtaa4 980 

248 aagaattgtc tccaccctga agcggggagt cccgcttcac atttctggaa atggtgcagc504 0 
24 9 cactggggac agttctgccc cgggcatggt tgtttcttca aggtcctcta aatataatccSlOO 

250 ctattcttac ataatccttg gccctgatgg ttttaagcaa gaactcctgt gtcccatggt5160 

251 ctccaccact caccatcacc ctgctgtagc aagagtccta gtcaggggag gtgcatttta5220 

252 gtagttaaat tgcacttatc catgagataa ataaaaggag aactgttttt atcagtggag5280 

253 gctaacctaa aatttcaaag tgtcgccttt ttgaaatctt gggcctctct ctctgtagaa534 0 

254 ccaatggccc tttgtggctc acggcctcgc acctaactgg agagttctga gctcctgcag5400 

255 ctcacctgag cccacagact aggcttcttg gctccttccg cagcatgcct gctcaccccc54 60 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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VERIFICATION SUlyMARY 

PATENT APPLICATION: US/09/840,746 



DATE: 05/22/2001 
TIME: 15:24:26 



10 M:2 

11 M:2 

558 M: 

559 M: 

578 M: 

579 M: 

580 M: 

581 M: 
600 M: 
602 M: 



70 C: 

71 C: 
341 
341 
341 
341 
341 
341 
341 
341 



Input Set : A:\Pto.amc 

Output Set: C:\CRF3\05222001\l840746.raw 

Current Application Number differs, Replaced Current Application Number 
Current Filing Date differs. Replaced Current Filing Date 
(46) "n" or "Xaa" used, for SEQ ID#:15 

for SEQ ID#:15 
for SEQ ID#:16 
for SEQ ID#:16 
for SEQ ID#:16 
for SEQ ID#:16 
for SEQ ID#:17 
for SEQ ID#:17 



(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 



'n" 
'n" 
'n" 
•n" 
'n" 
•n" 
'n" 



"Xaa" used, 
"Xaa" used, 
"Xaa" used, 
"Xaa" used, 
"Xaa" used, 
or "Xaa" used, 
or "Xaa" used. 



or 
or 
or 
or 
or 
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